大规模并行处理器编程：实践导向：硬件排序难题

在高性能硬件中，速度就是生命。想象一下GPU执行 Z缓冲：它必须每秒对数百万个深度值进行排序，以确定哪个像素在前方。为了实现这一点，工程师依赖于 无符号数值比较器，一种精简的电路，能够从最高有效位（MSB）到最低有效位（LSB）处理比特，无需任何认知开销。

标准补码无法通过这种“傻硬件”的测试。因为负数的符号位为1，正数的符号位为0，所以像-1（111...）在按位比较时看起来比+1（001...）更大。这造成了一个 不连续性，迫使硬件使用复杂且更慢的条件逻辑来判断数值大小。

为了恢复效率，我们采用 偏移编码 （偏置表示）。通过调整数值范围，使最小可能值映射为 000... 而最大值映射为 111...，我们确保比特模式能唯一标识一个数值，其 字典序 与数值顺序完全一致。

这一特性使得“傻”硬件比较器可以即时处理“聪明”的浮点数据。

TERMINALbash — 80x24

> Ready. Click "Run" to execute.

QUESTION 1

What is the primary architectural advantage of using Excess Encoding for exponents?

It allows the use of simple, fast unsigned comparators.

It increases the total range of representable numbers.

It eliminates the need for a sign bit in the mantissa.

It prevents overflow in integer addition.

QUESTION 2

In a 3-bit Two's Complement system, which bit pattern appears 'larger' to an unsigned comparator: -1 or 0?

-1 (pattern 111)

0 (pattern 000)

They appear equal.

Neither, the comparator crashes.

QUESTION 3

What property is defined by 'as the bit pattern increases, the represented decimal value also increases'?

Associativity

Monotonicity

Normalized Representation

Truncation

QUESTION 4

Why is the sign-bit discontinuity a problem for GPUs?

It forces sorting to happen on the CPU instead.

It necessitates complex, multi-stage conditional logic for every comparison.

It makes depth values (Z-buffer) impossible to store.

It causes bit-flip errors in the LSB.

QUESTION 5

If we use Excess-127 for an 8-bit exponent, what bit pattern represents the value 0?

00000000

01111111

10000000

11111111